CLAIMS 



What is claimed is: 

1 . A method comprising: 

receiving a first utterance from an intended talker at an integrated 

speech and speaker recognition system; 
generating a voice characteristic model for the intended talker; 
receiving a second utterance from the intended talker in a noisy 

environment at the speaker recognition system; 
processing a portion of speech associated with the second utterance, 

wherein processing comprises, 

computing a speaker verification score with the voice characteristic 
model associated with the portion of speech, 

computing a speech recognition score associated with the portion 
of speech, and 

generating a combined score by combining the speaker verification 
score and the speech recognition score; and 
selecting a best hypothesis associated with the second utterance and 
based upon the combined score. 

2. The method of claim 1 , wherein the portion of speech includes a word, a 
sentence, a syllable, or a frame. 
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3. The method of claim 1 , wherein processing further comprises altering a 
search path in a Viterbi search used by a speech recognizer. 

4. The method of claim 1 , wherein identifying a intended talker comprises using 
hotword speech recognition to identify the intended talker. 

5. The method of claim 1 , wherein the noisy environment includes additional 
speakers, music, stationary and non-stationary noise. 

6. The method of claim 1 , wherein the voice characteristic model includes a 
voice print, personal profile and linguistic characteristics. 

7. A system comprising: 

a speech system; and 

a speech input device connected to the speech system; wherein the 
speech system comprises, 

a voice server, wherein the server includes an integrated speech 
and speaker recognizer that, 

receives a first utterance from an intended talker via the speech 
input device; 

creates a voice characteristic model for the intended talker; 
receives a second utterance from the intended talker via the 
speech input device in a noisy environment; 
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12 processes a portion of speech associated with the second 

13 utterance, wherein the processor 

14 computes a speaker verification score with the voice characteristic 

15 model associated with the portion of speech, 

16 computes a speech recognition score associated with the portion of 

17 speech, and 

18 generates a combined score by combining the spealcer verification 

19 score and the speech recognition score; and 

20 selects a best hypothesis associated with the second utterance and 



p 21 based upon the combined score. 



1 8. The system of claim 7, wherein the speech input device comprises a 



2 cellular telephone, an analog telephone, a digital telephone, and a voice 



1^ 3 over internet protocol device. 

m 



fj 1 9. The system of claim 7, wherein the portion of speech includes a word, a 
2 sentence, a syllable, or a frame. 



1 1 0. The system of claim 7, wherein the server is further configured to alter a 

2 search path in a Viterbi search used by a speech recognizer. 

1 11. An integrated speech and speaker recognition system comprising: 

2 means for receiving a first utterance from the intended talker; 
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3 means for generating a voice characteristic model for the Intended 

4 talker; 

5 means for receiving a second utterance from the intended talker in a 

6 noisy environment at the speaker recognition system; 

7 means for processing a portion of speech associated with the second 

8 utterance, wherein processing comprises, 

9 means for computing a speaker verification score with the voice 

10 characteristic model associated with the portion of speech, 

11 means for computing a speech recognition score associated with 
Q 12 the portion of speech, and 

b 

IB 13 means for generating a combined score by combining the speaker 

« s 
saw 

1^ 14 verification score and the speech recognition score; and 

15 means for selecting a best hypothesis associated with the second 

9 16 utterance and based upon the combined score. 

fU 

pj 1 12. The system of claim 1 1 , wherein the portion of speech Includes a word, a 

2 sentence, a syllable, or a frame. 

1 13. The system of claim 1 1 , wherein the means for processing further 

2 comprises means for altering a search path in a Viterbi search used by a 

3 speech recognizer on the second utterance. 
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The system of claim 1 1 , wherein the means for identifying an intended 
tall^er comprises means for using hotword speech recognition to identify 
the intended talker. 

The system of claim 1 1 , wherein the noisy environment includes 
additional speakers, music, stationary and non-stationary noise. 

The system of claim 1 1 , wherein the voice characteristic model includes a 
voice print, personal profile and linguistic characteristics. 

A machine-readable medium having stored thereon a plurality of 
instructions, said plurality of instructions when executed by a machine, 
cause said machine to perform a process comprising: 

receiving a first utterance from the intended talker at an integrated 

speech and speaker recognition system; 
generating a voice characteristic model for the intended talker; 
receiving a second utterance from the intended talker in a noisy 

environment at the speaker recognition system; 
processing a portion of speech associated with the second utterance, 
wherein processing comprises, 

computing a speaker verification score with the voice characteristic 

model associated with the portion of speech, 
computing a speech recognition score associated with the portion 

of speech, and 



17 



15 generating a combined score by combining tlie speaker verification 

16 score and the speech recognition score; and 

17 selecting a best hypothesis associated with the second utterance and 

18 based upon the combined score. 

1 1 8. The machine-readable medium of claim 1 7 wherein the portion of speech 

2 includes a word, a sentence, a syllable, or a frame. 

1 1 9. The machine-readable medium of claim 1 7, having stored thereon 

2 additional instructions when processing a portion of speech, said 

3 additional instructions when executed by a machine, cause said machine 

4 to perform altering a search path in a Vlterbi search used by a speech 

5 recognizer. 

1 20. The machine-readable medium of claim 17, having stored thereon 

2 additional instructions when identifying a Intended talker, said additional 

3 instructions when executed by a machine, cause said machine to perfomri 

4 using hotword speech recognition to identify the intended talker. 
1 
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21 . The machine-readable medium of claim 17, wherein the noisy 
environment includes additional speakers, music, stationary and non- 
stationary noise. 

22. The machine-readable medium of claim 17, wherein the voice 
characteristic model includes a voice print, personal profile and linguistic 
characteristics. 

23. A method comprising: 

receiving an utterance from an intended talker at a speech recognition 
system; 

computing a speaker verification score with a voice characteristic 

model associated and with the utterance; 
computing a speech recognition score associated with the utterance; 

and 

selecting a best hypothesis associated with the utterance and based 
on both the speaker verification score and the speech recognition 
score. 

24. The method of claim 23, wherein the voice characteristic model is 
obtained from a voice model database. 

25. The method of claim 23, wherein the voice characteristic model is 
obtained from a first portion of the utterance. 
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1 26. A speech recognition system comprising: 

2 a spealcer verifier; 

3 a speecfi recognizer connected to the speaker verifier; and 

4 an input device connected to the speaker verifier and speech recognizer, 

5 wherein the input device receives an utterance from an intended talker; and 

6 wherein the speech recognizer generates a recognition score associated with the 

7 utterance, the speaker verifier generates a speaker verification score associated 

8 with the utterance; and the recognition score is combined with the verification 

9 score to select a best hypothesis of the utterance. 

1 27. The speech recognition system of claim 26, wherein the speech 

2 recognizer and speaker verifier are software entities residing on a speech 

3 server, and wherein the speech server comprises a processor, a bus 

4 connected to the processor, and memory connected to the bus that stores 

5 the software entities. 

1 28. The speech recognition system of claim 27, further comprising a database 

2 connected to the speech server, wherein the database stores a voice 

3 characteristic model of the intended talker. 
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